Overview

Dataset statistics

Number of variables27
Number of observations165
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory34.9 KiB
Average record size in memory216.8 B

Variable types

NUM25
BOOL2

Reproduction

Analysis started2020-07-01 15:55:34.039670
Analysis finished2020-07-01 15:57:15.917615
Duration1 minute and 41.88 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

songs_played is highly correlated with page_count and 5 other fieldsHigh correlation
page_count is highly correlated with songs_played and 5 other fieldsHigh correlation
songs_playlisted is highly correlated with page_count and 5 other fieldsHigh correlation
thumbed_up is highly correlated with page_count and 4 other fieldsHigh correlation
added_friends is highly correlated with page_count and 4 other fieldsHigh correlation
error_count is highly correlated with page_count and 5 other fieldsHigh correlation
redirect_count is highly correlated with page_count and 5 other fieldsHigh correlation
7d_songs_playlisted is highly correlated with 7d_songs_playedHigh correlation
7d_songs_played is highly correlated with 7d_songs_playlisted and 1 other fieldsHigh correlation
7d_thumbed_up is highly correlated with 7d_songs_playedHigh correlation
interactions_per_day is highly correlated with songs_per_dayHigh correlation
songs_per_day is highly correlated with interactions_per_dayHigh correlation
inter_per_session is highly correlated with time_meanHigh correlation
time_mean is highly correlated with inter_per_sessionHigh correlation
df_index has unique values Unique
thumbed_up_ratio has unique values Unique
songs_per_day has unique values Unique
interactions_per_day has unique values Unique
thumbed_down has 4 (2.4%) zeros Zeros
7d_songs_playlisted has 11 (6.7%) zeros Zeros
7d_thumbed_up has 3 (1.8%) zeros Zeros
7d_thumbed_down has 32 (19.4%) zeros Zeros
7d_added_friends has 27 (16.4%) zeros Zeros
thumbed_down_ratio has 4 (2.4%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct count165
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean117.14545454545454
Minimum0
Maximum224
Zeros1
Zeros (%)0.6%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile14.2
Q164
median123
Q3169
95-th percentile213.8
Maximum224
Range224
Interquartile range (IQR)105

Descriptive statistics

Standard deviation63.53751096
Coefficient of variation (CV)0.5423813601
Kurtosis-1.110552042
Mean117.1454545
Median Absolute Deviation (MAD)53
Skewness-0.1125336177
Sum19329
Variance4037.015299
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
22410.6%
 
9110.6%
 
8910.6%
 
8810.6%
 
8710.6%
 
8610.6%
 
8410.6%
 
8210.6%
 
8110.6%
 
8010.6%
 
Other values (155)15593.9%
 
ValueCountFrequency (%) 
010.6%
 
110.6%
 
310.6%
 
510.6%
 
810.6%
 
ValueCountFrequency (%) 
22410.6%
 
22310.6%
 
22210.6%
 
22110.6%
 
22010.6%
 

page_count
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count158
Unique (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1604.121212121212
Minimum62
Maximum9632
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum62
5-th percentile225.2
Q1671
median1310
Q32125
95-th percentile4159.4
Maximum9632
Range9570
Interquartile range (IQR)1454

Descriptive statistics

Standard deviation1374.657406
Coefficient of variation (CV)0.8569535744
Kurtosis9.003218665
Mean1604.121212
Median Absolute Deviation (MAD)710
Skewness2.406850391
Sum264680
Variance1889682.985
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
110221.2%
 
84821.2%
 
168221.2%
 
177521.2%
 
45721.2%
 
132221.2%
 
128821.2%
 
963210.6%
 
213210.6%
 
136310.6%
 
Other values (148)14889.7%
 
ValueCountFrequency (%) 
6210.6%
 
7610.6%
 
10210.6%
 
10810.6%
 
14310.6%
 
ValueCountFrequency (%) 
963210.6%
 
723010.6%
 
688010.6%
 
573210.6%
 
482510.6%
 

songs_played
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count159
Unique (%)96.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1321.2424242424242
Minimum41.0
Maximum8002.0
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum41
5-th percentile196.8
Q1530
median1073
Q31746
95-th percentile3490.2
Maximum8002
Range7961
Interquartile range (IQR)1216

Descriptive statistics

Standard deviation1141.558163
Coefficient of variation (CV)0.8640035633
Kurtosis8.890345852
Mean1321.242424
Median Absolute Deviation (MAD)591
Skewness2.381399645
Sum218005
Variance1303155.038
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
179721.2%
 
65021.2%
 
37721.2%
 
42921.2%
 
161021.2%
 
169421.2%
 
21410.6%
 
20410.6%
 
31210.6%
 
27910.6%
 
Other values (149)14990.3%
 
ValueCountFrequency (%) 
4110.6%
 
6510.6%
 
8010.6%
 
8810.6%
 
11110.6%
 
ValueCountFrequency (%) 
800210.6%
 
594510.6%
 
566410.6%
 
461910.6%
 
407910.6%
 

songs_playlisted
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count75
Unique (%)45.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.93333333333333
Minimum0.0
Maximum240.0
Zeros1
Zeros (%)0.6%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile5
Q113
median30
Q352
95-th percentile101
Maximum240
Range240
Interquartile range (IQR)39

Descriptive statistics

Standard deviation33.98836482
Coefficient of variation (CV)0.8960025875
Kurtosis9.167947646
Mean37.93333333
Median Absolute Deviation (MAD)19
Skewness2.374321392
Sum6259
Variance1155.208943
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1274.2%
 
963.6%
 
6153.0%
 
3053.0%
 
5053.0%
 
1153.0%
 
2042.4%
 
5842.4%
 
742.4%
 
2742.4%
 
Other values (65)11670.3%
 
ValueCountFrequency (%) 
010.6%
 
110.6%
 
221.2%
 
310.6%
 
431.8%
 
ValueCountFrequency (%) 
24010.6%
 
18110.6%
 
14810.6%
 
14610.6%
 
11810.6%
 

thumbed_up
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count105
Unique (%)63.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean72.8
Minimum2.0
Maximum437.0
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum2
5-th percentile7.2
Q128
median54
Q393
95-th percentile170.6
Maximum437
Range435
Interquartile range (IQR)65

Descriptive statistics

Standard deviation68.8219867
Coefficient of variation (CV)0.9453569602
Kurtosis8.43677352
Mean72.8
Median Absolute Deviation (MAD)30
Skewness2.494714978
Sum12012
Variance4736.465854
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3742.4%
 
5242.4%
 
2742.4%
 
3542.4%
 
2842.4%
 
11131.8%
 
5431.8%
 
731.8%
 
8131.8%
 
4231.8%
 
Other values (95)13078.8%
 
ValueCountFrequency (%) 
210.6%
 
310.6%
 
410.6%
 
521.2%
 
610.6%
 
ValueCountFrequency (%) 
43710.6%
 
38810.6%
 
33610.6%
 
30310.6%
 
29210.6%
 

thumbed_down
Real number (ℝ≥0)

ZEROS

Distinct count40
Unique (%)24.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean14.460606060606061
Minimum0.0
Maximum75.0
Zeros4
Zeros (%)2.4%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile1
Q15
median11
Q320
95-th percentile37.6
Maximum75
Range75
Interquartile range (IQR)15

Descriptive statistics

Standard deviation13.7917866
Coefficient of variation (CV)0.9537488639
Kurtosis6.194060307
Mean14.46060606
Median Absolute Deviation (MAD)6
Skewness2.181310639
Sum2386
Variance190.2133777
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6106.1%
 
9106.1%
 
495.5%
 
184.8%
 
884.8%
 
584.8%
 
284.8%
 
1784.8%
 
1274.2%
 
1163.6%
 
Other values (30)8350.3%
 
ValueCountFrequency (%) 
042.4%
 
184.8%
 
284.8%
 
363.6%
 
495.5%
 
ValueCountFrequency (%) 
7510.6%
 
7310.6%
 
7210.6%
 
6910.6%
 
5410.6%
 

added_friends
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count57
Unique (%)34.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean24.636363636363637
Minimum0.0
Maximum143.0
Zeros1
Zeros (%)0.6%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile3
Q111
median20
Q331
95-th percentile59.6
Maximum143
Range143
Interquartile range (IQR)20

Descriptive statistics

Standard deviation21.32160559
Coefficient of variation (CV)0.8654526252
Kurtosis9.250856078
Mean24.63636364
Median Absolute Deviation (MAD)9
Skewness2.524430657
Sum4065
Variance454.6108647
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
8106.1%
 
2863.6%
 
1763.6%
 
2363.6%
 
1363.6%
 
1253.0%
 
2153.0%
 
353.0%
 
1153.0%
 
1453.0%
 
Other values (47)10664.2%
 
ValueCountFrequency (%) 
010.6%
 
142.4%
 
231.8%
 
353.0%
 
431.8%
 
ValueCountFrequency (%) 
14310.6%
 
12210.6%
 
11010.6%
 
9310.6%
 
8910.6%
 

top_location
Real number (ℝ≥0)

Distinct count95
Unique (%)57.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.21818181818182
Minimum1
Maximum113
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum1
5-th percentile6.2
Q130
median56
Q379
95-th percentile105.8
Maximum113
Range112
Interquartile range (IQR)49

Descriptive statistics

Standard deviation31.44256722
Coefficient of variation (CV)0.5592953418
Kurtosis-1.065978092
Mean56.21818182
Median Absolute Deviation (MAD)24
Skewness-0.007466208193
Sum9276
Variance988.6350333
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
56148.5%
 
72127.3%
 
1853.0%
 
1753.0%
 
7953.0%
 
542.4%
 
10142.4%
 
7842.4%
 
1242.4%
 
2631.8%
 
Other values (85)10563.6%
 
ValueCountFrequency (%) 
110.6%
 
210.6%
 
310.6%
 
410.6%
 
542.4%
 
ValueCountFrequency (%) 
11310.6%
 
11210.6%
 
11131.8%
 
10910.6%
 
10810.6%
 
Distinct count2
Unique (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
1
86
0
79
ValueCountFrequency (%) 
18652.1%
 
07947.9%
 

error_count
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count158
Unique (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1604.121212121212
Minimum62
Maximum9632
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum62
5-th percentile225.2
Q1671
median1310
Q32125
95-th percentile4159.4
Maximum9632
Range9570
Interquartile range (IQR)1454

Descriptive statistics

Standard deviation1374.657406
Coefficient of variation (CV)0.8569535744
Kurtosis9.003218665
Mean1604.121212
Median Absolute Deviation (MAD)710
Skewness2.406850391
Sum264680
Variance1889682.985
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
110221.2%
 
84821.2%
 
168221.2%
 
177521.2%
 
45721.2%
 
132221.2%
 
128821.2%
 
963210.6%
 
213210.6%
 
136310.6%
 
Other values (148)14889.7%
 
ValueCountFrequency (%) 
6210.6%
 
7610.6%
 
10210.6%
 
10810.6%
 
14310.6%
 
ValueCountFrequency (%) 
963210.6%
 
723010.6%
 
688010.6%
 
573210.6%
 
482510.6%
 

redirect_count
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count158
Unique (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1604.121212121212
Minimum62
Maximum9632
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum62
5-th percentile225.2
Q1671
median1310
Q32125
95-th percentile4159.4
Maximum9632
Range9570
Interquartile range (IQR)1454

Descriptive statistics

Standard deviation1374.657406
Coefficient of variation (CV)0.8569535744
Kurtosis9.003218665
Mean1604.121212
Median Absolute Deviation (MAD)710
Skewness2.406850391
Sum264680
Variance1889682.985
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
110221.2%
 
84821.2%
 
168221.2%
 
177521.2%
 
45721.2%
 
132221.2%
 
128821.2%
 
963210.6%
 
213210.6%
 
136310.6%
 
Other values (148)14889.7%
 
ValueCountFrequency (%) 
6210.6%
 
7610.6%
 
10210.6%
 
10810.6%
 
14310.6%
 
ValueCountFrequency (%) 
963210.6%
 
723010.6%
 
688010.6%
 
573210.6%
 
482510.6%
 

days_active
Real number (ℝ≥0)

Distinct count92
Unique (%)55.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean83.18787878787879
Minimum11
Maximum256
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum11
5-th percentile27.2
Q162
median75
Q3108
95-th percentile151.6
Maximum256
Range245
Interquartile range (IQR)46

Descriptive statistics

Standard deviation38.2709512
Coefficient of variation (CV)0.4600544185
Kurtosis1.931411572
Mean83.18787879
Median Absolute Deviation (MAD)19
Skewness0.9476913839
Sum13726
Variance1464.665706
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
7584.8%
 
7174.2%
 
6353.0%
 
6242.4%
 
7642.4%
 
6742.4%
 
6842.4%
 
11031.8%
 
10231.8%
 
12431.8%
 
Other values (82)12072.7%
 
ValueCountFrequency (%) 
1110.6%
 
1310.6%
 
1610.6%
 
1910.6%
 
2121.2%
 
ValueCountFrequency (%) 
25610.6%
 
18810.6%
 
17210.6%
 
17110.6%
 
16810.6%
 

7d_songs_played
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count148
Unique (%)89.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean279.8424242424242
Minimum1.0
Maximum959.0
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum1
5-th percentile32.2
Q1111
median235
Q3390
95-th percentile701.8
Maximum959
Range958
Interquartile range (IQR)279

Descriptive statistics

Standard deviation218.0377275
Coefficient of variation (CV)0.7791446494
Kurtosis0.8456860898
Mean279.8424242
Median Absolute Deviation (MAD)137
Skewness1.069614434
Sum46174
Variance47540.45063
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
3331.8%
 
4821.2%
 
12121.2%
 
3221.2%
 
26921.2%
 
4421.2%
 
36121.2%
 
33321.2%
 
3821.2%
 
33021.2%
 
Other values (138)14487.3%
 
ValueCountFrequency (%) 
121.2%
 
810.6%
 
1010.6%
 
1310.6%
 
1510.6%
 
ValueCountFrequency (%) 
95910.6%
 
94810.6%
 
89610.6%
 
89110.6%
 
88010.6%
 

7d_songs_playlisted
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count29
Unique (%)17.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.751515151515152
Minimum0.0
Maximum33.0
Zeros11
Zeros (%)6.7%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q12
median6
Q311
95-th percentile22.6
Maximum33
Range33
Interquartile range (IQR)9

Descriptive statistics

Standard deviation6.938234523
Coefficient of variation (CV)0.8950810761
Kurtosis1.798577324
Mean7.751515152
Median Absolute Deviation (MAD)4
Skewness1.376526639
Sum1279
Variance48.1390983
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
21911.5%
 
4159.1%
 
1137.9%
 
9137.9%
 
0116.7%
 
3116.7%
 
7106.1%
 
6106.1%
 
884.8%
 
1284.8%
 
Other values (19)4728.5%
 
ValueCountFrequency (%) 
0116.7%
 
1137.9%
 
21911.5%
 
3116.7%
 
4159.1%
 
ValueCountFrequency (%) 
3310.6%
 
3110.6%
 
3010.6%
 
2631.8%
 
2510.6%
 

7d_thumbed_up
Real number (ℝ≥0)

HIGH CORRELATION
ZEROS

Distinct count45
Unique (%)27.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean15.351515151515152
Minimum0.0
Maximum56.0
Zeros3
Zeros (%)1.8%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile1
Q15
median12
Q320
95-th percentile41.8
Maximum56
Range56
Interquartile range (IQR)15

Descriptive statistics

Standard deviation12.53424872
Coefficient of variation (CV)0.816482842
Kurtosis0.799669563
Mean15.35151515
Median Absolute Deviation (MAD)7
Skewness1.11769071
Sum2533
Variance157.107391
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
5127.3%
 
1116.7%
 
4106.1%
 
995.5%
 
695.5%
 
2084.8%
 
1374.2%
 
1274.2%
 
1463.6%
 
1163.6%
 
Other values (35)8048.5%
 
ValueCountFrequency (%) 
031.8%
 
1116.7%
 
242.4%
 
342.4%
 
4106.1%
 
ValueCountFrequency (%) 
5610.6%
 
5210.6%
 
5021.2%
 
4810.6%
 
4710.6%
 

7d_thumbed_down
Real number (ℝ≥0)

ZEROS

Distinct count17
Unique (%)10.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.187878787878788
Minimum0.0
Maximum17.0
Zeros32
Zeros (%)19.4%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q35
95-th percentile9.8
Maximum17
Range17
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.325048833
Coefficient of variation (CV)1.043028626
Kurtosis3.185389525
Mean3.187878788
Median Absolute Deviation (MAD)2
Skewness1.646781354
Sum526
Variance11.05594974
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
13521.2%
 
03219.4%
 
32213.3%
 
22012.1%
 
5148.5%
 
6116.7%
 
4106.1%
 
853.0%
 
753.0%
 
921.2%
 
Other values (7)95.5%
 
ValueCountFrequency (%) 
03219.4%
 
13521.2%
 
22012.1%
 
32213.3%
 
4106.1%
 
ValueCountFrequency (%) 
1710.6%
 
1610.6%
 
1410.6%
 
1321.2%
 
1210.6%
 

7d_added_friends
Real number (ℝ≥0)

ZEROS

Distinct count20
Unique (%)12.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.387878787878788
Minimum0.0
Maximum28.0
Zeros27
Zeros (%)16.4%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median4
Q38
95-th percentile14.8
Maximum28
Range28
Interquartile range (IQR)7

Descriptive statistics

Standard deviation5.097330577
Coefficient of variation (CV)0.9460737291
Kurtosis2.517558508
Mean5.387878788
Median Absolute Deviation (MAD)3
Skewness1.336787808
Sum889
Variance25.98277901
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
02716.4%
 
12012.1%
 
31710.3%
 
7137.9%
 
8127.3%
 
2127.3%
 
5116.7%
 
4106.1%
 
674.2%
 
1263.6%
 
Other values (10)3018.2%
 
ValueCountFrequency (%) 
02716.4%
 
12012.1%
 
2127.3%
 
31710.3%
 
4106.1%
 
ValueCountFrequency (%) 
2810.6%
 
2410.6%
 
2210.6%
 
1621.2%
 
1542.4%
 

playlisted_ratio
Real number (ℝ≥0)

Distinct count162
Unique (%)98.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.02820802409884897
Minimum0.0
Maximum0.08641975308641975
Zeros1
Zeros (%)0.6%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile0.01806598778
Q10.0243902439
median0.02810650888
Q30.03196347032
95-th percentile0.03675886194
Maximum0.08641975309
Range0.08641975309
Interquartile range (IQR)0.007573226417

Descriptive statistics

Standard deviation0.007633865906
Coefficient of variation (CV)0.2706274597
Kurtosis20.56221127
Mean0.0282080241
Median Absolute Deviation (MAD)0.003856961444
Skewness2.318122225
Sum4.654323976
Variance5.827590867e-05
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.0238095238131.8%
 
0.0357142857121.2%
 
0.024130190810.6%
 
0.0303587856510.6%
 
0.0300429184510.6%
 
0.0340314136110.6%
 
0.02510.6%
 
0.0219465648910.6%
 
0.0336669699710.6%
 
0.0231213872810.6%
 
Other values (152)15292.1%
 
ValueCountFrequency (%) 
010.6%
 
0.0112994350310.6%
 
0.0133531157310.6%
 
0.0143884892110.6%
 
0.0146341463410.6%
 
ValueCountFrequency (%) 
0.0864197530910.6%
 
0.0445205479510.6%
 
0.0411522633710.6%
 
0.0397268777210.6%
 
0.0394285714310.6%
 

thumbed_up_ratio
Real number (ℝ≥0)

UNIQUE

Distinct count165
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.05488629303398286
Minimum0.02108433734939759
Maximum0.1130952380952381
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum0.02108433735
5-th percentile0.03496924486
Q10.04572907679
median0.05119047619
Q30.05862831858
95-th percentile0.09368205569
Maximum0.1130952381
Range0.09201090075
Interquartile range (IQR)0.01289924179

Descriptive statistics

Standard deviation0.0168197903
Coefficient of variation (CV)0.3064479193
Kurtosis2.127315874
Mean0.05488629303
Median Absolute Deviation (MAD)0.00625633232
Skewness1.347780081
Sum9.056238351
Variance0.0002829053458
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.0370370370410.6%
 
0.0475949367110.6%
 
0.0510.6%
 
0.0796269727410.6%
 
0.0484818193210.6%
 
0.0320334261810.6%
 
0.0517023959610.6%
 
0.0606826801510.6%
 
0.0460063897810.6%
 
0.0554572271410.6%
 
Other values (155)15593.9%
 
ValueCountFrequency (%) 
0.0210843373510.6%
 
0.0230414746510.6%
 
0.0245098039210.6%
 
0.0255102040810.6%
 
0.0287769784210.6%
 
ValueCountFrequency (%) 
0.113095238110.6%
 
0.111821086310.6%
 
0.107087827410.6%
 
0.103879849810.6%
 
0.0982142857110.6%
 

thumbed_down_ratio
Real number (ℝ≥0)

ZEROS

Distinct count158
Unique (%)95.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.011303402104294612
Minimum0.0
Maximum0.0374331550802139
Zeros4
Zeros (%)2.4%
Memory size1.3 KiB

Quantile statistics

Minimum0
5-th percentile0.004023045267
Q10.007310704961
median0.009463722397
Q30.0124137931
95-th percentile0.0310765962
Maximum0.03743315508
Range0.03743315508
Interquartile range (IQR)0.005103088143

Descriptive statistics

Standard deviation0.007302483765
Coefficient of variation (CV)0.6460429964
Kurtosis3.769201087
Mean0.0113034021
Median Absolute Deviation (MAD)0.002404898868
Skewness1.937214341
Sum1.865061347
Variance5.332626914e-05
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
042.4%
 
0.00595238095221.2%
 
0.00889877641821.2%
 
0.00793650793721.2%
 
0.0127795527221.2%
 
0.0130890052410.6%
 
0.00809061488710.6%
 
0.00974817221810.6%
 
0.0114777618410.6%
 
0.0100633619110.6%
 
Other values (148)14889.7%
 
ValueCountFrequency (%) 
042.4%
 
0.00232558139510.6%
 
0.0036540803910.6%
 
0.00381679389310.6%
 
0.0039062510.6%
 
ValueCountFrequency (%) 
0.0374331550810.6%
 
0.0368663594510.6%
 
0.0342741935510.6%
 
0.0341340075910.6%
 
0.0332446808510.6%
 

songs_per_day
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct count165
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.058769331860425
Minimum0.7735849056603774
Maximum105.55
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum0.7735849057
5-th percentile2.668035692
Q17.54
median12.33333333
Q323.40789474
95-th percentile49.39680874
Maximum105.55
Range104.7764151
Interquartile range (IQR)15.86789474

Descriptive statistics

Standard deviation16.46408399
Coefficient of variation (CV)0.9116946835
Kurtosis5.854022416
Mean18.05876933
Median Absolute Deviation (MAD)7.137254902
Skewness2.079348662
Sum2979.69694
Variance271.0660616
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
17.9444444410.6%
 
12.7822580610.6%
 
11.9006622510.6%
 
32.8727272710.6%
 
11.7647058810.6%
 
8.29365079410.6%
 
8.40963855410.6%
 
19.4705882410.6%
 
30.9015151510.6%
 
2.25641025610.6%
 
Other values (155)15593.9%
 
ValueCountFrequency (%) 
0.773584905710.6%
 
1.20370370410.6%
 
1.21118012410.6%
 
1.21929824610.6%
 
1.24193548410.6%
 
ValueCountFrequency (%) 
105.5510.6%
 
74.510.6%
 
70.7738095210.6%
 
69.9259259310.6%
 
66.2142857110.6%
 

interactions_per_day
Real number (ℝ≥0)

HIGH CORRELATION
UNIQUE

Distinct count165
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean21.903938194055314
Minimum1.169811320754717
Maximum123.2
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum1.169811321
5-th percentile3.22349346
Q19.14
median15.28225806
Q328.14102564
95-th percentile59.06045902
Maximum123.2
Range122.0301887
Interquartile range (IQR)19.00102564

Descriptive statistics

Standard deviation19.71432324
Coefficient of variation (CV)0.9000355584
Kurtosis5.451594162
Mean21.90393819
Median Absolute Deviation (MAD)8.498258065
Skewness2.039149221
Sum3614.149802
Variance388.654541
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
21.4861111110.6%
 
11.2105263210.6%
 
19.0344827610.6%
 
30.610.6%
 
14.2790697710.6%
 
36.553030310.6%
 
15.2822580610.6%
 
12.1575342510.6%
 
10.1018518510.6%
 
49.2380952410.6%
 
Other values (155)15593.9%
 
ValueCountFrequency (%) 
1.16981132110.6%
 
1.35403726710.6%
 
1.40740740710.6%
 
1.52631578910.6%
 
1.68548387110.6%
 
ValueCountFrequency (%) 
123.210.6%
 
92.451612910.6%
 
86.0714285710.6%
 
84.938271610.6%
 
78.7142857110.6%
 

time_mean
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count135
Unique (%)81.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean346.09090909090907
Minimum37
Maximum1179
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum37
5-th percentile142
Q1228
median321
Q3431
95-th percentile640.8
Maximum1179
Range1142
Interquartile range (IQR)203

Descriptive statistics

Standard deviation166.8964853
Coefficient of variation (CV)0.4822330807
Kurtosis4.161622197
Mean346.0909091
Median Absolute Deviation (MAD)98
Skewness1.513614716
Sum57105
Variance27854.43681
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
43242.4%
 
21142.4%
 
32931.8%
 
23231.8%
 
38131.8%
 
37021.2%
 
21621.2%
 
45721.2%
 
32621.2%
 
33921.2%
 
Other values (125)13883.6%
 
ValueCountFrequency (%) 
3710.6%
 
8710.6%
 
8910.6%
 
10810.6%
 
12510.6%
 
ValueCountFrequency (%) 
117910.6%
 
90410.6%
 
87110.6%
 
81710.6%
 
81610.6%
 

session_count
Real number (ℝ≥0)

Distinct count45
Unique (%)27.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean17.175757575757576
Minimum1
Maximum107
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum1
5-th percentile3
Q18
median13
Q321
95-th percentile40.4
Maximum107
Range106
Interquartile range (IQR)13

Descriptive statistics

Standard deviation15.82023693
Coefficient of variation (CV)0.9210794261
Kurtosis10.11839628
Mean17.17575758
Median Absolute Deviation (MAD)6
Skewness2.785708026
Sum2834
Variance250.2798965
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
6116.7%
 
10116.7%
 
1595.5%
 
795.5%
 
1284.8%
 
1174.2%
 
474.2%
 
874.2%
 
974.2%
 
363.6%
 
Other values (35)8350.3%
 
ValueCountFrequency (%) 
142.4%
 
363.6%
 
474.2%
 
542.4%
 
6116.7%
 
ValueCountFrequency (%) 
10710.6%
 
8610.6%
 
7621.2%
 
7110.6%
 
6310.6%
 

inter_per_session
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count161
Unique (%)97.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean100.98616922792375
Minimum15.5
Maximum334.8888888888889
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum15.5
5-th percentile42.59090909
Q169.95
median92.88888889
Q3119.625
95-th percentile181.0133333
Maximum334.8888889
Range319.3888889
Interquartile range (IQR)49.675

Descriptive statistics

Standard deviation46.33954567
Coefficient of variation (CV)0.4588702198
Kurtosis4.342145789
Mean100.9861692
Median Absolute Deviation (MAD)24.97222222
Skewness1.53284204
Sum16662.71792
Variance2147.353493
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
51.6666666721.2%
 
9121.2%
 
61.3333333321.2%
 
11121.2%
 
156.666666710.6%
 
3610.6%
 
125.666666710.6%
 
56.5384615410.6%
 
77.0909090910.6%
 
41.6111111110.6%
 
Other values (151)15191.5%
 
ValueCountFrequency (%) 
15.510.6%
 
25.3333333310.6%
 
2910.6%
 
3610.6%
 
40.1210.6%
 
ValueCountFrequency (%) 
334.888888910.6%
 
25910.6%
 
246.410.6%
 
235.37510.6%
 
214.910.6%
 

sessions_per_day
Real number (ℝ≥0)

Distinct count156
Unique (%)94.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.23248584287136917
Minimum0.023809523809523808
Maximum1.2459016393442623
Zeros0
Zeros (%)0.0%
Memory size1.3 KiB

Quantile statistics

Minimum0.02380952381
5-th percentile0.04685714286
Q10.09677419355
median0.1690140845
Q30.2857142857
95-th percentile0.6228571429
Maximum1.245901639
Range1.222092116
Interquartile range (IQR)0.1889400922

Descriptive statistics

Standard deviation0.2096834475
Coefficient of variation (CV)0.901919209
Kurtosis6.21988112
Mean0.2324858429
Median Absolute Deviation (MAD)0.08568075117
Skewness2.241243262
Sum38.36016407
Variance0.04396714816
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
0.147058823531.8%
 
0.32394366221.2%
 
0.10937521.2%
 
0.117647058821.2%
 
0.0769230769221.2%
 
0.221.2%
 
0.106666666721.2%
 
0.0740740740721.2%
 
0.137931034510.6%
 
0.0810810810810.6%
 
Other values (146)14688.5%
 
ValueCountFrequency (%) 
0.0238095238110.6%
 
0.024390243910.6%
 
0.02510.6%
 
0.0263157894710.6%
 
0.0296296296310.6%
 
ValueCountFrequency (%) 
1.24590163910.6%
 
1.13432835810.6%
 
1.03614457810.6%
 
0.887510.6%
 
0.851351351410.6%
 

cancelled
Boolean

Distinct count2
Unique (%)1.2%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
0
129
1
36
ValueCountFrequency (%) 
012978.2%
 
13621.8%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Missing values

Sample

First rows

df_indexpage_countsongs_playedsongs_playlistedthumbed_upthumbed_downadded_friendstop_locationuser_gendererror_countredirect_countdays_active7d_songs_played7d_songs_playlisted7d_thumbed_up7d_thumbed_down7d_added_friendsplaylisted_ratiothumbed_up_ratiothumbed_down_ratiosongs_per_dayinteractions_per_daytime_meansession_countinter_per_sessionsessions_per_daycancelled
00795673.09.037.04.012.051179579551151.01.04.00.02.00.0133530.0548960.00593512.94230815.2884624596132.5000000.1176470
1132142682.061.0148.027.049.02613214321464284.04.020.04.04.00.0227360.0551620.01006341.26153849.4461543163591.8285710.5468750
23218195.05.05.00.01.09602182181601.00.00.00.00.00.0255100.0255100.0000001.2111801.354037200454.5000000.0250000
351245942.023.035.011.019.0104012451245172331.07.010.04.04.00.0243900.0371160.0116655.4450877.1965321852159.2857140.1220930
48520423.09.019.06.017.0530520520115174.04.010.02.010.00.0212260.0448110.0141513.6465524.482759188957.7777780.0782611
59940772.030.037.06.017.056094094068128.01.04.00.05.00.0388100.0478650.00776211.18840613.6231885276156.6666670.0882350
610671518.012.023.08.07.094167167137187.03.013.05.02.00.0231210.0443160.01541413.63157917.6578952111067.1000000.2702701
713600476.012.018.09.02.018160060043186.06.09.05.01.00.0251570.0377360.01886810.81818213.636364271785.7142860.1627911
81413921131.031.039.015.028.02001392139244155.02.012.02.03.00.0273850.0344520.01325125.13333330.9333333291499.4285710.3181821
915310257.07.017.03.06.07213103108560.02.05.01.01.00.0271320.0658910.0116282.9883723.604651184651.6666670.0705881

Last rows

df_indexpage_countsongs_playedsongs_playlistedthumbed_upthumbed_downadded_friendstop_locationuser_gendererror_countredirect_countdays_active7d_songs_played7d_songs_playlisted7d_thumbed_up7d_thumbed_down7d_added_friendsplaylisted_ratiothumbed_up_ratiothumbed_down_ratiosongs_per_dayinteractions_per_daytime_meansession_countinter_per_sessionsessions_per_daycancelled
155213801667.012.030.06.014.0518018016933.00.02.00.02.00.0179640.0449100.0089829.52857111.4428575505160.2000000.0724640
15621431912676.077.0118.032.040.01213191319160480.011.018.03.08.00.0287640.0440790.01195443.86885252.31147535231102.9354840.5166670
15721630142580.064.0124.024.042.070130143014116495.010.037.04.03.00.0247970.0480430.00929922.05128225.76068411799334.8888890.0775860
15821772305945.0181.0292.072.0110.01207230723083337.06.015.03.012.00.0304410.0491090.01210970.77381086.0714292848684.0697671.0361450
159218815640.017.035.04.013.03181581571350.012.019.03.08.00.0265210.0546020.0062408.88888911.3194442161267.9166670.1690140
16022020911694.046.091.025.032.0790209120916293.02.05.00.00.00.0271390.0536870.01474926.88888933.1904762113363.3636360.5322580
16122121761802.052.092.024.040.0602176217673641.014.035.07.013.00.0288410.0510260.01331124.35135129.40540539119114.5263160.2602740
16222224041975.061.0108.012.032.08802404240487291.06.020.02.03.00.0308700.0546560.00607322.44318227.3181822673080.1333330.3448280
16322328912401.058.0115.022.045.04112891289163234.05.06.02.05.00.0241470.0478770.00915937.51562545.17187537028103.2500000.4444440
164224614505.011.027.05.012.0700614614133303.09.018.02.07.00.0217390.0533600.0098813.7686574.5820901901155.8181820.0827070